From Linguistics to Ontologies - The Role of Named Entities in the Conceptualisation Process
نویسندگان
چکیده
Ontologies that have been built from texts can be associated with lexical information that is crucial for the semantic annotation of texts and all semantic search tasks. However, the entire pocess of building ontologies from texts cannot be fully automated and it is important to guide the knowledge engineer during the building process. This paper presents an enriched version of TERMINAE, which is a text-based methodology for ontology design. It combines a fact-based approach of modeling with the more traditional concept-centric one. We show that named entities can be used to enrich an existing ontology and to bootstrap the acquisition process. In other words, named entities are used for the conceptualisation of ontologies and not only for their population. This approach is illustrated on two use-cases based on policy documents and evaluated by measuring the Precision and Recall of the resulting ontologies with respect to pre-existing ontologies independently built by
منابع مشابه
Abstract Concepts Through the Lens of Linguistic and Extra-Linguistic Knowledge
The paper deals with the rigorous methods used in the research of concepts representing abstract notions like “friendship”, “love”, “hatred”, “conscience”, and “envy”. Concepts of that kind have no visible physical support in the material world except for the sound forms of the words representing them, thus causing additional difficulties in classification, research and analysis as well as stip...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملA Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features
Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...
متن کاملبهبود شناسایی موجودیتهای نامدار فارسی با استفاده از کسره اضافه
Named entity recognition is a process in which the people’s names, name of places (cities, countries, seas, etc.) and organizations (public and private companies, international institutions, etc.), date, currency and percentages in a text are identified. Named entity recognition plays an important role in many NLP tasks such as semantic role labeling, question answering, summarization, machine ...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011